Disk drive partitioning methods and apparatus

ABSTRACT

A storage device has partitions that are separately addressed by distinct IP addresses. This allows direct access of the partitions, on a peer-to-peer basis, by any other device that can communicate using IP. Preferred storage devices support spanning between or among partitions of the same device, as well as between or among different storage devices. Both multicast and proxy spanning are contemplated. Combinations of the inventive storage devices with each other, and with prior art storage devices are contemplated, in all manner of mirroring and other arrangements. In still other aspects of the invention, a given storage device can comprise one or more types of media, including any combination of rotating and non-rotating media, magnetic and optical, and so forth.

This application is a divisional of U.S. Ser. No. 10/473,509 filed on Mar. 25, 2004; which is a national phase of PCT application number PCT/US02/40199 filed on Dec. 16, 2002; which claims priority to provisional application number 60/425,867 filed on Nov. 12, 2002.

FIELD OF THE INVENTION

The field of the invention is data storage devices.

BACKGROUND OF THE INVENTION

There is a trend within the field of electronics to physically (i.e. geographically) disaggregate functionality, and to rely instead on networked resources. Of special interest are resources available over a packet communications network such as the Internet. In addition to the data being transferred, packets include header information such as type of data contained in the packet, i.e. HTML, voice, ASCII, etc., and origination and destination node information. The header information permits error checking, and routing across packet switched networks such as the Internet between devices that may be widely spaced apart. The header information also allows extremely disparate devices to communicate with each other—such as a clock radio to communicate with a computer. Recently published US patent application no. 20020031086, (Welin, Mar. 14, 2002) refers to linking “computers, IP phones, talking toys and home appliances such as refrigerators, microwave ovens, bread machines, blenders, coffee makers, laundry machines, dryers, sweepers, thermostat assemblies, light switches, lamps, fans, drape and window shade motor controls, surveillance equipment, traffic monitoring, clocks, radios, network cameras, televisions, digital telephone answering devices, air conditioners, furnaces and central air conditioning apparatus.”

Communications with storage devices has not kept pace with the trend to disaggregate resources. Disk access has always been under the control of a disk operating system such as DOS, or Microsoft® Windows®. Unfortunately, putting the operating system at the conceptual center of all computing devices has resulted in a dependence on such operating systems, and has tended to produce ever larger and more complicated operating systems. Now that many electronic devices, from personal digital assistants to telephones, digital cameras, and game consoles, are becoming smaller and ever more portable, the dependence on large operating systems has become a liability. One solution is to provide a stripped-down operating system that requires much less overhead. Microsoft® CE® is an example. That solution, however, sacrifices considerable functionality present in the larger systems.

What is needed is a storage device that can be directly accessed by multiple other devices, without the need to go through an operating system.

SUMMARY OF THE INVENTION

In the present invention a storage device has partitions that are separately addressed by distinct IP addresses. This allows direct access of the partitions,.on a peer-to-peer basis, by any other device that can communicate using IP. Many limitations on access to the storage device can thereby be eliminated, including geographical limitations, and the need for a given storage partition to be under the central control of a single operating system.

Preferred storage devices support spanning between or among partitions of the same device, as well as between or among different storage devices. Both multicast and proxy spanning are contemplated.

Combinations of the inventive storage devices with each other, and with prior art storage devices are contemplated, in all manner of mirroring and other arrangements.

In still other aspects of the invention, a given storage device can comprise one or more types of media, including any combination of rotating and non-rotating media, magnetic and optical, and so forth.

Various objects, features, aspects and advantages of the inventive subject matter will become more apparent from the following detailed description of preferred embodiments, along with the accompanying drawing figures.

BRIEF DESDCRIPTION OF THE DRAWING

FIG. 1 is a schematic of a prior art disk drive split into multiple partitions, but where the entire memory is accessed using a single IP address.

FIG. 2 is a schematic of a prior art storage system in which three disk drives are addressed in their entireties using three different IP addresses.

FIG. 3 is a schematic of a storage device having multiple partitions that are separately addressed by different IP addresses.

FIG. 4 is a schematic of a storage device having multiple partitions that are separately addressed by different IP addresses, and some of the partitions are addressed using multiple IP addresses.

FIG. 5 is a schematic of a storage device having multiple partitions comprising different storage media.

FIG. 6 is a schematic of a storage device having multiple partitions, two of which are spanned using multicast spanning.

FIG. 7 is a schematic of a storage device having multiple partitions, two of which are spanned using proxy spanning.

FIG. 8 is a schematic of a storage system in which three storage devices are logically coupled using multicast spanning.

FIG. 9 is a schematic of a storage system in which three storage devices are logically coupled using proxy spanning.

FIG. 10 is a schematic of a storage system in which partitions of a first storage device are mirrored on partitions of one or more additional storage device using multicast mirroring.

DETAILED DESCRIPTION

Prior art FIG. 1 generally depicts a disk drive 10 that is split into multiple partitions 10 _(A), 10 _(B), 10 _(C) . . . 10 _(N). The entire storage area is addressed using a single address IP₁, with individual blocks of data being addressed by a combination of IP₁ and some other information such as partition and offset, or Logical Block Address (LBA). The data is thus always accessed under the control of a disk operating system that provides the additional information. For that reason drive 10 is usually located very close to the processor that runs the operating system, and is usually connected to a hard bus of a computer, RAID or other system.

It is known to format the various partitions 10A . . . 10N differently from one another, under control of different operating systems. However, the entire memory space comprises a single media type, namely rotating magnetic memory, even though there may be some sort of RAM buffer (not shown).

It should be appreciated that the term “IP” is used herein in a broad sense, to include any networking protocol. Thus, an IP address is used as a euphemism for a network address.

Prior art FIG. 2 generally depicts a storage system 20 in which three disk drives 21, 22, 23 are addressed using three different IP addresses, IP1, IP2, and IP3. The drives can have multiple partitions (drive 21 has three partitions 21 _(A), 21 _(B), 21 _(C) (not shown), and drive 23 has two partitions 23 _(A) and 23 _(B) (not shown)), but here again individual blocks of data are addressed using a combination of the IP address, some other information such as partition and offset, or LBA. Drives 21, 22, 23 can be spanned and/or mirrored, but the data on each drive is always accessed using that drive's particular IP address.

In FIG. 3 is a storage device 30 according to the present invention has three partitions 21 _(A), 21 _(B), 21 _(C), which are separately addressed by different IP addresses IP₁, IP₂, IP₃, respectively. Those skilled in the art will appreciate that showing a small plurality of partitions is merely a matter of convenience, in this and other figures, and that storage device 30 could have any practical number of partitions. Similarly, it should be appreciated that depicting storage devices without partitions indicates that such devices have no partitions.

Utilizing IP addresses to route packets directly to and from partitions facilitates the use of very light communication protocols. In particular, the partitions may be directly addressed at the IP level of TCP/IP or UDP/IP stack. It should be appreciated, however, that in order make use of the IP addresses, the storage device 30 (and indeed the various partitions) would need to have sufficient functionality to communicate using IP. That functionality could be designed into the devices (or partitions), or it could be added onto storage devices using an IP adapter 32 (not shown). Indeed, the adapter in such circumstances would essentially be a simple block-to-packet and packet-to-block translator.

Storage device 30 can be connected to any suitable bus by any suitable means. Thus, the operative principles herein can operate across a wide variety of physical buses and protocols, including ATA, ATAPI, SCSI, Fiber CH, PCMCIA, CardBus, and USB. Storage device 30 can also alternatively or additionally operate across a network acting as a virtual IP bus, with the term “IP” being used herein generically with reference to any internetworking protocol that handles packets. It is contemplated, for example, that a user may have a stand-alone storage device that communicates wirelessly with a Local Area Network (LAN), which in turn may be connected to a WAN or to the Internet. Other devices that are also connected to the network (whether in the home, office, or elsewhere) could directly access one or more partitions of the storage device. For example, an IP capable television (not shown) could display images or movies stored on one partition, while a digital camera (not shown) could store/retrieve images on another partition. Still another partition might hold an operating system and office software for use with a laptop, or even an IP capable display and IP capable keyboard and mouse. Printing from any of the partitions might occur on an IP capable printer that is also connected wirelessly, or by hardwire, to the network.

An interesting corollary is that the partitions or other elements can all communicate as peers on a peer-to-peer network. As used herein, the term “element” refers to a hardware unit that is a functional portion of a device, and traditionally communicates with other units of the same device across a bus, without having its own IP address. This can completely eliminate dependence on any particular operating system, and can eliminate operating systems altogether. In addition, many of the elements attached to the network will be dependent on other elements attached to the network to perform tasks that are not within their individual capacities, and will be able to discover, reserve, and release the resources of other peers needed to perform such tasks. Peers will preferably be able to discover the other elements attached to the network, the characteristics of the other elements attached to the network, and possibly the contents of at least some of the elements attached to the network. Such discovery is accomplished without the assistance of a master device, and will preferably involve direct communication between the peer elements.

Preferred networks will be masterless in that all elements have equal access to the network and the other elements attached to the network. The peer elements of the network will preferably communicate with each other utilizing low-level protocols such as those that would equate to those of the transport and lower layers of the OSI model. Preferred embodiments will utilize TCP and UDP IP protocols for communication between elements.

Storage device 30 is preferably able to dynamically create partitions upon receipt of requests from network elements. For example, when a network element requests use of device 30, the network element may provide a unique identifier, possibly a name, to storage device 30, which in turn associates the identifier with any newly created partition. In some instances the network element may also request a particular storage size to be allocated, including all of the remaining storage available on the storage device 30.

In preferred embodiments, the IP addresses for such partitions are obtained from an address server such as a DHCP server upon request from the storage device 30. It is important to note, however, that address allocation devices such as DHCP servers are not masters, since they don't control the network, elements coupled to the network, or the sharing of resources between elements. Assignment of IP addresses to partitions may additionally or alternatively occur during initialization of the device, such as when it is first turned on.

Since storage device 30 may be associated with only a single network interface card (NIC), it is preferred that storage elements be able to obtain multiple IP addresses despite having a single NIC and a single media access control (MAC) address. This can be accomplished by providing a unique partition identifier to an address server when trying to obtain a IP address from the address server. It is contemplated that associating a name provided by an element with any partition created for that element makes it possible to identify each of the partitions of a storage element, despite the fact that IP address associated with each partition may have changed since the partition was created.

Additional details can be found in concurrently filed PCT application No. PCT/US02/40205, entitled “Communication Protocols, Systems and Methods” and PCT application No. PCT/US02/40198, entitled “Electrical Devices With Improved Communication”, the disclosures of which are incorporated herein by reference.

In FIG. 4, storage device 40 is similar to storage device 30 in that it has multiple partitions 41 _(A), 41 _(B), 41 _(C), 41 _(D) that are separately addressed by different IP addresses IP₁, IP₂, IP₃, IP₄, respectively. But here some of the partitions are addressed using multiple IP addresses. In particular, partition 41 _(A) is addressed with IP₁ and IP₅. Partition 41 _(D) is addressed with IP₄, IP₆ and IP₇.

In FIG. 5 a storage device 50 has multiple partitions comprising different storage media. In this particular example there are 2 partitions of rotating media 50 _(A), 50 _(B), one partition of flash memory 50 _(C). All other practical combinations of these and other media are also contemplated. As in FIG. 3, the various partitions are separately addressed by different IP addresses IP₁, IP₂, IP₃, respectively.

In FIG. 6 a storage device 60 has multiple partitions 60 _(A), 60 _(B), 60 _(C), 60 _(D), addressed by IP addresses IP₁, IP₂, IP₃, IP₄, and IP₅ (multicast) respectively. Two of these partitions, 60 _(A) and 60 _(C), are spanned in that partition 60 _(A) extends from logical address a to logical address b, while partition 60 _(C) continues from logical address b+1 to logical address c. The spanned set is thus logical address a to logical address c. The spanning here is multicast spanning, because the partitions share multicast IP₅ which is used to address both partitions 60 _(A) and 60 _(C).

In FIG. 7 a storage device 70 has multiple partitions 70 _(A), 70 _(B), 70 _(C), 70 _(D), addressed by IP addresses IP₁, IP₂, IP₃, IP₈, respectively. (The use of IP₈ here rather than IP₄ is intended to illustrate that the IP addresses need not be consecutive in any manner.) Here again two of the partitions are spanned, 70 _(A) and 70 _(C), in that partition 70 _(A) extends from logical address a to logical address b, while partition 70 _(C) continues from logical address b+1 to logical address c. The spanned set is thus once again logical address a to logical address c. Here, however, we are dealing with proxy spanning as opposed to multicast spanning. IP₁ is used to address partition 70 _(A), while the second part of the spanned data, in partition 70 _(C), is addressed by the IP1 proxy using IP₃. Of course, it is possible to combine multicast spanning and proxy spanning within the same storage device.

In FIG. 8 a storage system 100 has three storage devices 110, 120, and 130 coupled to depict multicast spanning. Device 110 has three partitions 110 _(A), 110 _(B) and 110 _(C), which are separately addressed using IP addresses IP₁, IP₂, and IP₃, respectively. Device 120 has four partitions 120 _(A), 120 _(B), 120 _(C), and 120 _(D), which are separately addressed using IP addresses IP₄, IP₅, IP₆, and IP₇, respectively. Device 130 is not partitioned, which for our purposes is the same as saying that it only has one partition. The entirely of the storage area of device 130 is addressed using IP address IP₈. The spanning in this case is among all three drives. Partition 110C extends from logical address a to logical address b; partition 120D continues from logical address b+1 to logical address c, and the data space of device 130 extends from logical address c+1 to logical address d. The data set extends from logical address a to logical address d.

FIG. 9 is similar to FIG. 8, in that spanning occurs across three drives, and the data set extends from logical address a to logical address d. The main conceptual difference is that the storage devices are logically coupled using proxy spanning rather than multicast spanning. Here, device 210 has three partitions 210 _(A), 210 _(B) and 210 _(C), which are separately addressed using IP addresses IP₁, IP₂, and IP₃, respectively. Device 230 is not partitioned. The entirely of the storage area of device 230 is addressed using IP address IP₄. Device 220 has three partitions, 220 _(A), 220 _(B) and 220 _(C), which are separately addressed using IP addresses IP₄, IP₅, and IP₆, respectively. Partition 210 _(C) extends from logical address a to logical address b; the data space of partition 220 _(C) continues from logical address b+1 to logical address c, and partition 230 extends from logical address c+1 to logical address d.

As elsewhere in this specification, the specific embodiments shown with respect to FIG. 9 are merely examples of possible configurations. A greater or lesser number of storage devices could be utilized, and indeed spanning may be protean, in that devices and/or partitions may be added to or dropped from the spanning over time. There can also be any combination of multicast and proxy spanning across and/or within storage devices, which may have the same or different media. Moreover, the use of IP addresses facilitates physically locating the various storage devices virtually anywhere an IP network can reach, regardless of the relative physical locations among the devices.

In FIG. 10 a storage system 300 provides mirroring of partitions between three different physical storage devices 310, 320 and 330. This could be done by proxy, in a manner analogous to that described above for proxy spanning, or in higher performance systems using multicasting. Thus, partitions in multiple storage devices are addressed using the same IP address. In this particular embodiment, storage device 310 has partitions 310 _(A), 310 _(B), and 310 _(c), addressed using IP addresses IP₁, IP₂, IP₃ and IP9. Storage device 320 has partitions 320 _(A), 320 _(B), and 320 _(C), addressed using IP addresses IP₄, IP₅, IP6 and IP₉. Write requests to IP₃ or IP₉ will result in partition 310 _(C) 320 _(C) and 330 _(C) storing the same data. Read requests to IP₁ address will result in 310 _(C), 320 _(C) and 330 _(C) responding with the same information, with presumably the requester using whichever data arrives first. In the Multicast form it may be prefefred that device 310,320 and 330 listen for the first data returned by any member of the mirrored set, and then remove that request from their request que if another device completes the request before they complete the request.

Communications

In preferred embodiments, communications between a storage element and a non-storage element, will utilize a datagram protocol in which data blocks are atomically mapped to a target device. A datagram sent between elements will preferably comprise command (CMD), logical block address (LBA), data, and token fields, and no more than X additional bytes where X is one of 1, 2, 7, 10, 17, and 30. The data field of such a datagram is preferably sized to be the same as the block size (if applicable) of the element to which the datagram is addressed. As such, an element sending a quantity of data to a storage element where the quantity of data is larger than the block size of the storage element will typically divide the quantity of data into blocks having the same size as the blocks of the storage element, assign LBAs to the blocks, and send each block and LBA pair to the storage element in a datagram.

It is preferred that the datagrams be communicated between elements encapsulating them within addressed packets such as IP packets, and the IP address of the encapsulating packet be used to identify both the element a packet is intended to be sent to, and the partition within the element that the datagram pertains to.

It is preferred that datagram recipients handle datagrams on a first come, first served basis, without reordering packets, and without assembling the contents of the data fields of datagrams into a larger unit of data prior to executing a command identified in the CMD field. As an example, an storage element may receive a datagram containing a block of data, an LBA, and a write command. The storage element, without having to wait for any additional packets, utilizes the IP address of the packet enclosing the datagram to identify the partition to be used, and utilizes the LBA to identify the location within the partition at which the data in the data field is to be written.

Handling the data in individual datagrams as they arrive rather than reassembling the data permits the use of an implied ACK for each command. Using an implied rather than an explicit ACK results in a substantial increase in performance.

Marketing of Storage Devices and Adapters

It is contemplated that once persons in the industry recognize the benefits of having storage devices having partitions that are accessed using their own IP addresses, companies will start producing and/or marketing such devices It is also contemplated that companies will start producing and/or marketing adapters that includes a functionality (hardware or software, or come combination of the two) to permit traditional disk drives, flash memories, and other storage devices to operate in that manner.

Thus, methods falling within the inventive subject matter include manufacturing or selling a disk drive or other storage device in which the partitions can utilize their own IP addresses to execute packet communication with other network elements. Other inventive methods include manufacturing or selling adapters that enable prior art type storage devices to do the same. Indeed it is contemplated that companies will recognize that such adapters are available, and will continue to manufacture or sell prior art type storage devices, knowing (or even advertising) that users can employ such adapters to enable the prior art type storage devices to use in an infringing manner.

Thus, specific embodiments and applications of the inventive storage devices have been disclosed. It should be apparent, however, to those skilled in the art that many more modifications besides those already described are possible without departing from the inventive concepts herein. The inventive subject matter, therefore, is not to be restricted except in the spirit of the appended claims. Moreover, in interpreting both the specification and the claims, all terms should be interpreted in the broadest possible manner consistent with the context. In particular, the terms “comprises” and “comprising” should be interpreted as referring to elements, components, or steps in a non-exclusive manner, indicating that the referenced elements, components, or steps may be present, or utilized, or combined with other elements, components, or steps that are not expressly referenced. 

1. An apparatus comprising: a storage medium; a network interface configured to communicatively couple the apparatus to a network; and a storage element communicatively coupled to the storage medium and the network interface and configured to receive, via the network interface, a request for a partition allocation, the request including a name; to create and allocate a partition of the storage medium based at least in part on the request; to obtain, from a dynamic host configuration protocol (DHCP) server, an internet protocol (IP) address for the partition of the storage medium; and to associate the name with the IP address.
 2. The apparatus of claim 1, wherein the request further includes a requested size of the partition allocation.
 3. The apparatus of claim 1, wherein the request further includes a token and one or more authentication parameters to secure subsequent access to the partition.
 4. The apparatus of claim 1, wherein the storage element is further configured to receive, from a requester via the network interface, a release request; and to release the partition based at least in part on the release request.
 5. The apparatus of claim 4, wherein the requester is a network element.
 6. The apparatus of claim 4, wherein the storage element is configured to release the partition by being further configured to remove any residual data in the partition; to request the DHCP server to release the IP address; and to place the partition into an allocation pool for a future allocation.
 7. The apparatus of 1, wherein the storage element is further configured to receive, from a requester via the network interface, a broadcast name resolution request including a requested name to resolve; to determine that the requested name is the name; and to respond, to the requester via the network interface, with the IP address.
 8. The apparatus of claim 7, wherein the requester is a network element.
 9. The apparatus of claim 1, wherein, prior to receiving the request, the storage element is configured to receive, from a network element via the network interface, a broadcast find request; and to provide the network element with a root node IP address of the storage medium based at least in part on the broadcast find request.
 10. The apparatus of claim 1, wherein the storage element is configured to operate independently of an operating system.
 11. A method comprising: receiving, from a network element via a network interface, a request for a partition allocation, the request including a name; creating and allocating a partition of a storage medium based at least in part on the received request; obtaining, from a dynamic host configuration protocol (DHCP) server, an internet protocol (IP) address for the partition of the storage medium; and associating the name with the IP address.
 12. The method of claim 11, further comprising: securing access to the allocated partition based at least in part on a token and one or more authentication parameters received in the request.
 13. The method of claim 11, further comprising: receiving, from a requester via the network interface, a release request; and releasing the partition based at least in part on the release request.
 14. The method of claim 13, wherein said releasing the partition comprises: removing any residual data in the partition; requesting the DHCP server to release the IP address; and placing the partition into an allocation pool for a future allocation.
 15. The method of claim 14, further comprising: receiving, from the requester via the network interface, a broadcast name resolution request including a requested name to resolve; determining that the requested name is the name; and responding, to the requester via the network interface, with the IP address.
 16. An apparatus comprising: a network interface; and a network element communicatively coupled to the network interface and configured to transmit, to a storage element via the network interface, a request for a partition of a storage medium associated with the storage element to be created and allocated to the network element, the request including a name to be used to identify a partition created in reponse to the request; and to receive, from the storage element via the network interface, an internet protocol (IP) address to associate with the name.
 17. The apparatus of claim 16, wherein the network interface is further configured to transmit, prior to transmitting the request, a broadcast message including the name to determine that the name is unique to all recipients of the broadcast message.
 18. The apparatus of claim 16, wherein the request further includes a token and the network element is further configured to provide the token to another network element to enable the another network element to access a partition created in response to the request.
 19. The apparatus of claim 16, wherein the network element is further configured to transmit, to the storage element via the network interface, a resource availability request; and to receive, from the storage element via the network interface, a resource availability response indicating an amount of storage available on the storage medium.
 20. An apparatus comprising: one or more storage media; and a storage element communicatively coupled to the one or more storage media and configured to partition the one or more storage media into a plurality of partitions; and to obtain, from a dynamic host configuration protocol (DHCP) server, a unique internet protocol (IP) address for each partition of the plurality of partitions, and a multicast IP address for at least two partitions of the plurality of partitions.
 21. The apparatus of claim 20, wherein the at least two partitions include a spanned set of logical addresses.
 22. The apparatus of claim 20, wherein the storage element is further configured to receive one or more requests from at least one network element and to partition the one or more storage media and to obtain the unique IP addresses based at least in part on the one or more requests.
 23. An apparatus comprising: a storage medium; and a storage element communicatively coupled to the storage medium and configured to partition the storage medium into a plurality of partitions; and to obtain, from a dynamic host configuration protocol (DHCP) server, a first unique internet protocol (IP) address for a first partition of the plurality of partitions and a second unique IP address for the first partition.
 24. The apparatus of claim 23, wherein the second unique IP address is associated with a logical address that spans the first partition and one or more other partitions.
 25. The apparatus of claim 23, wherein the storage element is further configured to receive one or more requests from at least one network element and to partition the storage medium and to obtain the first unique IP address and the second unique IP address based at least in part on the one or more requests. 